Skip to main content

Reply to this post | Go Back
View Post [edit]

Poster: Alex.brollo Date: Mar 17, 2014 9:53am
Forum: forums Subject: Hwo to manage books that are collections

Some books are "collection of works", since they collect different works by the same author/by different authors. I'm uploading from Opal Libri Antichi and I found that source website followed a convention - that is, to split such books into individual pdf; this means that the relationship between the individual "book" (and its metadata: i.e. its title) is broken into a series of differently titled items.

Would it a good idea to merge those files into one, so rebuilding the "physical book", then adding the list of the contained, individual works into the description field of the item? It breaks a little bit automation.... but perhaps it adds quality to uploads.

Reply to this post
Reply [edit]

Poster: Jeff Kaplan Date: Mar 17, 2014 10:25am
Forum: forums Subject: Re: Hwo to manage books that are collections

probably. typically we advise that when files share the same metadata (title, author, etc.) that they be in a single item page. so if a book is broken into multiple pdfs i'll assume it is one book and put it in a single item.

not sure you have to rebuild the book so much as name the files in a logical manner and upload them all to to single item.

Reply to this post
Reply [edit]

Poster: Alex.brollo Date: Mar 20, 2014 10:19am
Forum: forums Subject: Re: Hwo to manage books that are collections

My reference source (Opal) shares two-paged images, so that uploading needs splitting+cropping (that has been automatized); so, if I can simply automatyze the deletion of doubled pages at the beginning of pdf files to be merged with an additional parameter, the work is done. Unluckily, I can't simply upload original list of pdf files, since any of them needs splitting/cropping (and often removing some pages at the beginning of individual pdf files).

Following a personal "rule", I'm uploading original pdf file/files into a sourceFiles.zip file into item folder, so that source is saved for any future use/derivation.

Reply to this post
Reply [edit]

Poster: Jeff Kaplan Date: Mar 21, 2014 3:32pm
Forum: forums Subject: Re: Hwo to manage books that are collections

fwiw you can specify that the bookreader be one-up by default so that a single images (in your case it looks like a spread) appears. to do this you would add this metadata pair to the item:
bookreader-defaults=mode/1up

Reply to this post
Reply [edit]

Poster: jrosenberg Date: Nov 20, 2014 7:24pm
Forum: forums Subject: Re: Hwo to manage books that are collections

You might try using Briss to split/crop the PDFs:
http://briss.sourceforge.net/

Reply to this post
Reply [edit]

Poster: Alex.brollo Date: Mar 21, 2014 3:37pm
Forum: forums Subject: Re: Hwo to manage books that are collections

Thanks! I found 15 items coming from Opal, and uploaded by other users, so far not assigned to opallibriantichi collection:


The first one is a "double page" item and.... just now I see that djvu and jp2.zip files are perfectly splitted!!!

Me stupid.... this simplifies a lot uploading scripts and lowers terribly Tool Labs servers load. :-(

I'll review deeply the whole job.

Reply to this post
Reply [edit]

Poster: Alex.brollo Date: Mar 21, 2014 4:16pm
Forum: forums Subject: Re: Hwo to manage books that are collections

No, I took a better look, pages are not splitted into djvu file nor into jp2/jpg zip files; and splitting is absolutely needed to proofread text.

So, splitting before uploading seems needed, if I'm not going wrong again.

Reply to this post
Reply [edit]

Poster: Jeff Kaplan Date: Mar 22, 2014 8:06am
Forum: forums Subject: Re: Hwo to manage books that are collections

splitting them would certainly be better.